@INSIDE EigenScore

mentions 1 type Person feed RSS

04:00

2026-06-03

arxiv.org

large-language-models

Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs

Researchers found that a linear probe applied to mid-layer hidden states of quantized large language models can detect hallucinations with up to 1.000 AUROC, significantly outperforming sampling-based…

// co-occurs with top 6 entities

Llama-3.1-8B 1 Mistral-7B 1 Qwen2.5-7B 1 TruthfulQA 1 HaluEval-QA 1 FEVER 1